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Novel composition and method for the detection of diseases associated 
with amyloid-like fibril or protein aggregate formation 

The present invention relates to novel compositions useful for elucidating the 
onset or progress of diseases of preferably neuronal origin associated with the 
formation of amyloid-like fibrils or protein aggregates. Further, the present invention 
relates to methods for monitoring said formation as well as to methods for identifying 
inhibitors of said formation. Additionally, the invention relates to inhibitors identified 
by the method of the invention as well as to pharmaceutical compositions comprising 
said inhibitors. 

A variety of diseases, both in humans and animals, is characterized by the 
pathogenic formation of amyloid-like fibrils or protein aggregates in neuronal tissues. 
A well-known and typical example of such diseases is Alzheimer's disease (AD). AD 
is characterized by the formation of neurofibrillar tangles and p-amyloid fibrils in the 
brain of AD patients. Similarly, scrapie is associated with the occurrence of scrapie- 
associated fibrils in brain tissue. 

Another class of these diseases is characterized by an expansion of CAG 
repeats in certain genes. The affected proteins display a corresponding 
polyglutamine expansion. Said diseases are further characterized by a late onset in 
life and a dominant pathway of inheritance. 

A typical representative of this class of diseases is Huntington's disease. 
Huntington's disease (HD) is an autosomal dominant progressive neurodegenerative 
disorder characterized by personality changes, motor impairment and subcortical 
dementia (Harper, 1991). It is associated with a selective neuronal cell death 
occurring primarily in the cortex and striatum (Vonsattel et aL, 1985). The disorder is 
caused by a CAG/polyglutamine (polygln) repeat expansion in the first exon of a 
gene encoding a large -350 kDa protein of unknown function and designated 
huntingtin (HDCRG, 1993). The CAG repeat is highly polymorphic and varies from 



6-39 repeats on chromosomes of unaffected individuals and 35-180 repeats on HD 
chromosomes (Rubinsztein et al., 1996; Sathasivam et al., 1997). The majority of 
adult onset cases have expansions ranging from 40-55 units, whereas expansions of 
70 and above invariably cause the juvenile form of the disease. The normal and 
mutant forms of huntingtin have been shown to be expressed at similar levels in the 
central nervous system and in peripheral tissues (T rattier et al., 1995a). Within the 
brain, huntingtin was found predominantly in neurons and was present in cell bodies, 
dentrites and also in the nerve terminals. Immunohistochemistry, electron 
microscopy and subcellular fractionations have shown that huntingtin is primarily a 
cytosolic protein associated with vesicles and/or microtubules, suggesting that it 
plays a functional role in cytoskeletal anchoring or transport of vesicles (DiFiglia et 
al., 1995; Gutekunst et al., 1995; Sharp et al., 1995) Huntingtin has also been 
detected in the nucleus (de Rooij et al., 1996; Hoogeveen et al., 1993) suggesting 
that transcriptional regulation cannot be ruled out as a possible function of this 
protein. 

In addition to HD, CAG/polygln expansions have been found in at least six 
other inherited neurodegenerative disorders which include: spinal and bulbar 
muscular atrophy (SBMA), dentatorubral pallidoluysian atrophy (DRPLA), and the 
spinocerebellar ataxias (SCA) types 1, 2, 3 and 6 (referenced in Bates et al. 1997). 
The normal and expanded size ranges are comparable with the exception of SCA6 in 
which the expanded alleles are smaller and the mutation is likely to act by a different 
route. However, in all cases the CAG repeat is located within the coding region and 
is translated into a stretch of polygln residues. Although the proteins harbouring the 
polygln sequences are unrelated and mostly of unknown function, it is likely that the 
mutations act through a similar mechanism. Without exception, these proteins are 
widely expressed and generally localized in the cytoplasm. However, despite 
overlapping expression patterns in brain, the neuronal cell death is relatively specific 
and can differ markedly (Ross. 1995), indicating that additional factors are needed to 
convey the specific patterns of neurodegeneration. 

Several investigators have proposed that HD is caused by a toxic gain of 
function, which in turn is caused by abnormal protein-protein interactions related to 
the elongated polygln. It is possible that the binding of a protein to the polygln region 
could either confer a new property on huntingtin or alter its normal interactions 
causing selective cell death either through the specific expression patterns of the 
interacting protein or through the selective vulnerability of certain cells. To date, four 
potential huntingtin-interacting proteins have been isolated: HAP1 (Li et al.. 1995), 
GAPDH (Burke et al., 1996), HIP2 (Kalchman et al., 1996) and HIP-I (Kalchman et 



al M 1997; Wanker et al., 1997). However, it has not been demonstrated whether the 
binding of these proteins to huntingtin is involved in the selective neuropathology. A 
gain of function mechanism has been supported by the identification of an antibody 
that specifically reacts with the pathogenic polygln expansions (Trottier et al., 1995b) 
This indicates that upon expansion into the pathogenic range, a polygln sequence 
may undergo a conformational change. Poly-L-glutamines form pleated sheets of p- 
strands held together by hydrogen bonds between their amides (Perutz et al., 1994). 
It was proposed that the expanded glutamine repeats in huntingtin may function as 
polar zippers, joining protein molecules together (Perutz, 1996). In the long run, this 
could result in the precipitation of huntingtin protein in specific neurons causing the 
observed selective neuronal loss. Thus, the mechanism underlying HD would be 
similar to scrapie, Creutzfeldt-Jakob or Alzheimer's disease, in which p-sheet 
secondary structures lead to the formation of toxic protein aggregates in selective 
neurons (Caughey and Chesebro, 1997). 

Recently, strains of mice (R6) that are transgenic for the HD mutation have 
been generated (Mangiarini et al., 1996). In these mice exon 1 of the human HD 
gene carrying CAG repeat expansions of 115-156 units is expressed under the 
control of the human HD promoter. It has been demonstrated that the transgenic 
animals exhibit a progressive neurological phenotype that exhibits many of the motor 
and non motor features of HD. The phenotype includes a resting tremor; irregular 
gait; rapid, abrupt shuddering movements; stereotypic grooming movements and 
epileptic seizures. Coincident with the onset of the movement disorder the mice 
show a progressive weight loss. Neuropathological analysis has shown a reduction 
in brain weight (which precedes that in body weight) and the presence of neuronal 
intranuclear inclusions (Nils) predating any evidence of neuronal dysfunction (Davies 
et al., 1997). The Nils are immunoreactive for N-terminal huntingtin antibodies that 
detect the transgene protein and for ubiquitin but do not contain the endogenous 
mouse huntingtin. At the ultrastructural level, a solitary intranuclear inclusion 
appears as a roughly circular pale structure of a fine granular nature with occasional 
filamentous structures and devoid of a membrane. In addition, the neurons invariably 
have indentations of the nuclear membrane and an apparent increase in the density 
and clustering of nuclear pores. All three of these ultrastructural nuclear changes 
have previously been reported in EM studies from HD patients (Roizin et al., 1979; 
Roos and Bots, 1983; Tellez-Nagel et al., 1974). 

Thus, a large body of data has accumulated that describes aspects of the 
pathology of the above-discussed diseases. However, the actual mechanisms 
leading to the onset of the various disease states are still unknown. Although a 



variety of hypotheses have been formulated in the art, it is equally unknown how the 
amyloid or aggregate formation is triggered or caused within affected cells or tissues. 
Without a detailed knowledge of the formation of said aggregates, the development 
of a suitable pharmaceutical composition for treating such diseases appears rather 
difficult. The technical problem underlying the present invention was therefore to 
provide means and methods suitable for the eventual elucidation of the etiology of 
these diseases and the development of appropriate medicines. 

The above technical problem is solved by the embodiment characterized in the 
claims. 

Accordingly, the present invention relates to a composition comprising 

(a) a nucleic acid molecule encoding a fusion protein comprising 

(aa) a (poly)peptide that enhances solubility and/or prevents aggregation of said 
fusion protein; and 

(ab) an amyloidogenic (poly)peptide that" has the ability to self-assemble into 
amyloid-like fibrils or protein aggregates; 

(b) a vector containing the nucleic acid molecule of (a); 

(c) a host transformed with the vector of (b); 

(d) a fusion protein encoded by the nucleic acid of (a) or a functional derivative 
thereof; and/or 

(e) an antibody specific for the fusion protein of (d). 

As used herein, the term "(poly)peptide" relates to a polypeptide or a peptide 
depending on the length of the amino acid string. Said (poly)peptide has the ability to 
enhance solubility of a fusion partner in said fusion protein and thus of the fusion 
protein itself. Additionally, or alternatively, said (poly)peptide prevents the 
aggregation of the fusion partner and thus of the fusion protein. Said (poly)peptide is 
combined within said fusion protein with an amyloidogenic (poly)peptide having the 
above recited features. The connection of both (poly)peptides may be via a linker or 
by a direct attachment. It is preferred that either the linker or either (poly)peptide 
comprises a cleavable site. Said cleavable site should render both (poly)peptides 
essentially intact'. Alternatively, said fusion protein may comprise a number of 
cleavage sites. Upon cleavage, which may be exhaustive or under limiting 
conditions, the amyloidogenic (poly)peptide should, when used for the purposes of 
the present invention, retain the ability to self-assemble. The person skilled in the art 
is in the position to determine appropriate conditions for a corresponding limited 
cleavage. 



The term "functional derivative" refers to a fusion protein which comprises, for 
example, modified amino acids or amino acid substitutions and retains the functions 
of the fusion protein detailed herein above. 

The term "antibody specific for the fusion protein" comprised in the composition 
of the invention is intended to mean that said antibody is only specific for the fusion 
protein but not for either of the above cited components of said fusion protein. 

In accordance with the present invention, it could surprisingly be shown that the 
composition comprising the above recited components can be used for the 
elucidation of amyloid-like fibril or protein aggregate formation. The components of 
the composition can be used in varying combinations to test, for example, for specific 
conditions under which amyloids are formed in vitro. The in vitro data obtained with 
the composition of the invention may then be compared to or brought into relation 
with the in vivo situation and appropriate conclusions may be drawn therefrom. 

The in vitro systems that can be established with the composition of the 
invention allow formation of highly stable amyloid-like protein aggregates. Such 
aggregates may be obtained, for example, by proteolytic cleavage of GST fusion 
proteins comprising exon 1 of the HD gene and containing expanded polygln 
sequences. Alternatively, such aggregates may be obtained by lowering the pH 
value from 8 to 5 or by increasing the protein concentration. The arrays of fibrillar 
structures of varying sizes and shapes observed by electron microscopy surprisingly 
clearly resemble those of purified amyloids. Furthermore, the polarization 
microscopic properties of the fibrils stained with Congo red are strikingly similar to 
those described for amyloids. The green-gold birefringence of the amyloid-like fibrils 
indicates that the polymers have common structural features. Although the Congo 
red staining does not determine conclusively whether the fibrils consist of p-pleated 
sheets, the method suggests that this is likely in view of experience gained with other 
protein polymers (Caputo et al., 1992)/ However, it has been generally accepted that 
naturally occurring mammalian protein polymers that exhibit fibrillar structures and 
green birefringence after Congo red staining should be classified as amyloids 
(Glenner, 1980). 'instead of the HD gene, other nucleic acid sequences encoding 
amyloidogenic (poly)peptides may be used to generate said fusion proteins. 
Preferably, the composition of the invention is a diagnostic composition. The 
composition of the invention may also be a kit. The diagnostic composition can 
advantageously be employed in the assessment of a disease state whereas the kit 
may rather be employed in the development of, for example, inhibitors or in the 
elucidation of amyloid formation. 
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It is a preferred embodiment of the composition of the invention, that said 
amyloidogenic (poly)peptide comprises a polyglutamine expansion. In the prior art it 
has been shown by X-ray diffraction studies that synthetic peptides containing 
polyglns form p-sheets strongly held together by hydrogen bonds (Perutz et al., 
1994). Because synthetic poly(L-glutamine) is insoluble in water, a synthetic peptide 
with the sequence Asp2-Gln-|5-Lys2 was used in that study. A stretch of 10 
glutamines was also inserted into the loop of chymotrypsin inhibitor-2 (CI2) t and it 
was demonstrated by analytical ultracentrifugation that the recombinant protein, in 
addition to monomers formed dimers and trimers, whereas wild-type CI2 was present 
only in the monomeric form (Stott et al., 1995). It has been proposed that the polygln 
stretch functions as a polar zipper, joining proteins together. However, the 
hypothesis that glutamine repeats in proteins form p-pleated sheets and induce 
protein aggregation by a mechanism similar to that observed in spongiform 
encephalopathy (TSE) diseases (Caughey and Chesebro, 1997) could not be proven 
with this recombinant protein. Most likely, the length of the polygln sequence inserted 
into CI2 was too short. Accordingly, the experimental data actually obtained teach 
away from the above recited hypothesis. 

In a particularly preferred embodiment said polyglutamine expansion comprises 
at least 35 glutamines. In a further particularly preferred embodiment said 
polyglutamine expansion comprises at least 51 glutamines. 

Our studies with the GST-HD fusion proteins containing polygln sequences of 
varying lengths demonstrate that a certain length of the polygln stretch is necessary 
for the formation of amyloid-like fibrils in vitro. When the purified fusion proteins were 
analyzed by SDS-PAGE, insoluble high molecular weight protein aggregates were 
only detected with the proteins containing 81 and 122 glutamines (Fig. 1a and b), 
whereas the protein with 51 glutamines was soluble and no fibrillar structures were 
detected by electron microscopy (Fig. 4a). This indicates that the critical length of 
the polygln stretch in the fusion proteins leading to the formation of aggregates is 
greater than 51 glutamines. Accordingly, fusion proteins of the invention with a 
polyglutamine expansion of more than 51 glutamines, such as 81 or 122 glutamines, 
may be employed in studies for the formation of aggregates that render a cleavage 
reaction unnecessary. However, when the GST-tag, which is known to enhance the 
solubility of many proteins (Smith and Johnson, 1988), was cleaved by limited 
digestion with trypsin, the liberated HD exon 1 protein with 51 glutamines also 
started to form aggregates (Fig. 3a) and the amount of these aggregates increased 
when the GST-tag was totally degraded with trypsin (Fig. 3b). This indicates that in 



the HD exon 1 protein 51 glutamines are sufficient to form aggregates, whereas 20 
and 30 glutamines under the same conditions are not. The minimum critical length 
essential for the development of amyloid-like structures after removal of the GST-tag 
is not known and has to be determined. However, preliminary experiments in our 
laboratory suggest that the threshold for the formation of HD exon 1 protein 
aggregates is between 35 - 48 glutamines. This result is strikingly similar to the 
pathological threshold in HD, SBMA, DRPLA, SCA1 , SCA2 and SCA3. In all of these 
neurodegenerative polygln diseases a pathological phenotype was found when more 
than 41 repeats were present, suggesting that the elongation of the polygln repeat 
beyond a certain length may lead to a phase change in the affected proteins. This 
could, for example, be a change from random coils to hydrogen-bonded hairpins in 
the polygln stretch, see Perutz (1996). 

With the understanding that the applicant is not bound by any scientific theory, 
a mechanism is proposed for the fibril formation induced by proteolytic cleavage of 
GST-HD51 as shown in Fig. 7. Based on the known crystal structure of GST with a 
C-terminal fusion peptide (Lim et ah, 1994) and the fact that the purified GST protein 
is a dimer we suppose that native GST-HD51 exists as a dimer with two expanded 
polygln sequences which form stable hairpins consisting of antiparallel p-strands 
strongly held together by hydrogen bonds between the main chain and the side 
chain amides. In the native protein both hairpins are tightly bound to the surface of 
GST and not accessible for protein-protein interactions with other polygln sequences. 
As a result of the cleavage with a site-specific protease, both hairpins become 
accessible and p-sheets with hairpins from other cleaved protein molecules are 
formed. This transient population of intermediates consisting of GST molecules and 
hairpins leads to the formation of polygln- containing p-sheet fibrils and free GST 
molecules. This model is supported by the finding of potential intermediate structures 
present on one or both ends of the growing fibrils (Fig. 4c and d). These clots of 
varying sizes were not detected when GST-HD51 was digested to completion with 
trypsin, which totally degrades the GST-tag, whilst they were detectable upon limited 
digestion, leaving the GST moiety largely intact (Fig. 4d). This indicates that these 
structures are transient intermediates. 

A model of the formation of amyloid-like fibrils via transient intermediates is not 
without precedent. Booth et al. (1997) have shown that amyloidogenic lysosome 
variants aggregate on heating, unlike the wild-type protein, and that the lysozyme 
fibrils are formed from potential precursor proteins. It is possible that the transient 
GST-HD intermediates function as nuclei for ordered protein aggregation, very 
similarly to protein crystallization and microtubule formation, which are nucleation- 
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dependent polymerisations (Jarrett and Lansburry, 1993). Once a nucleus is formed, 
the further addition of monomers becomes thermodynamically favorable and results 
in rapid polymerization. An important feature of a nucleation-dependent process is a 
lag time before the aggregates are detectable. During this period, dinners and trimers 
are formed. Fig. 3a shows that during proteolytic cleavage of GST-HD51 dimers of 
the released HD portion are formed, the concentration of which then decreases upon 
prolonged incubation concomitant with an increase in the formation of large protein 
aggregates (Fig. 3b). Although additional kinetic studies will be necessary to prove 
this assumption, preliminary results in our laboratory suggest that a "one- 
dimensional" crystallization leads to the formation of in vitro amyloid-like huntingtin 
aggregates. 

Accordingly, the present invention provides both the possibilities to analyze 
aggregation of amyloid-like aggregates using defined cleavage conditions or using 
fusion proteins comprising long polyglutamine stretches that render the cleavage 
unnecessary. Whereas the cleavage of the" fusion protein enables to set a distinct 
starting point of the reaction and therefore of the aggregate formation, the second 
alternative has the advantage that the use of a cleaving agent is rendered obsolete. 

In a further particularly preferred embodiment of the invention said (poly)peptide 
defined in (ab) is huntingtin, androgen receptor, atropin, TATA binding protein, or 
ataxin-1 ,-2,-3 or -6 or a fragment or derivative thereof. 

The fibrillar structures formed by proteolytic cleavage of purified GST-HD51 in 
vitro and also in the brains of mice transgenic for the HD mutation are very similar to 
structures detected in brain sections or purified protein fractions of Alzheimer's 
disease (AD), Creutzfeldt-Jakob disease (CJD), Gerstmann-Strassler-Scheinker 
syndrome (GSS), fatal familial isomnia (FFI), kuru, bovine spongiform 
encephalopathy (BSE) and scrapie (Caughey and Chesebro, 1997). In all these 
disorders, the accumulation of amyloid-like fibrils in the central nervous system is 
accompanied by loss of nerve cells and a neuropathological phenotype. However, 
the molecular basis of these diseases is not known. For the first time, our results 
raise the possibility that HD, DRPLA, SBMA, SCA1, SCA2, and SCA3 are also the 
result of a toxic amyloid fibrillogenesis. Although the detection of amyloid-like fibrils 
has not previously been reported in these inherited diseases, our results strongly 
suggest that polygln-containing polymers are also formed in vivo by their detection in 
a transgenic model of polyglutamine disease. The high molecular weight aggregates 
were exclusively detected in the nuclear fraction prepared from transgene brain 
material, which is in good agreement with the results of Davies et al. (1997), who 



demonstrated the presence of the transgene protein and ubiquitin in neuronal 
intranuclear inclusions, from a time prior to the development of a neurological 
phenotype. Strikingly, ultrastuctural analysis has shown similar intranuclear 
inclusions to be present in the cortical and striatal biopsy material from HD patients 
(Roizin et al., 1979) some of which showed clear evidence of intranuclear fibrils of up 
to 1 pm in length. Preliminary experiments with nuclear protein fractions prepared 
from HD brain material indicated that insoluble huntingtin aggregates are indeed 
present in these fractions. However, additional control experiments have to be 
performed to substantiate these results. 

One possible explanation for the absence of detection of high molecular weight 
huntingtin protein aggregates in HD brains could be that the aggregates consist 
mainly of polygln-containing peptides which have been cleaved from the full length 
protein. In such a case, only an antibody raised against an N-terminal huntingtin 
fragment, containing the polygln sequence, would be able to detect the aggregates 
in the nucleus. In most of the previous immunohistochemical studies, antibodies 
raised against the central or C-terminal portion of huntingtin have been used, which 
detect the full length protein (350 kDa) in the cytosol and in the membrane containing 
fractions (DiFiglia et al., 1995; Sharp et al., 1995; Trottier et al., 1995a). However, 
antibodies raised against peptides and fusion proteins from the N- and C-terminus of 
huntingtin also detected the protein in the nucleus (de Rooij et al., 1996; Hoogeveen 
et al., 1993), indicating that huntingtin is also present in this subcellular 
compartment. There are several lines of evidence to implicate a shorter polygln- 
containing peptide/protein fragment of huntingtin in the pathology of HD. Ikeda et al. 
(1996) showed that a short fragment of the MJD1 protein containing 79 polyglns 
(Q79C) but not the full length protein with the elongated repeat induced apoptotic cell 
death in COS cells. The polygln-containing protein fragment migrated in SDS-gels at 
a position much higher than expected from its molecular weight, even after boiling in 
the presence of 2% SDS. These results are in good agreement with our data 
obtained using the GST-HD fusion proteins containing elongated polygln sequences. 
Fig. 1a and b show that the expression of GST-HD83 and GST-HD122 in E. coli was 
dramatically reduced compared to the fusion proteins containing 20-51 repeats, and 
additional studies have' indicated that the elongated glutamines are toxic for E. coli 
cells. Our results confirm the experiments of Onodera et al. (1996) who determined 
that GST fusion proteins with 59-81 glutamines inhibit both cell growth and protein 
synthesis in E. coli, whereas the expression of proteins with less than 35-40 
glutamines had little or no effect. 
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The possibility that polygln-containing cleavage products of huntingtin cause 
neurodegeneration in HD is substantiated by the finding of Goldberg et al. (1996) 
who showed that an N-terminal 80 kDa huntingtin fragment is cleaved from the full 
length protein by apopain, a proapoptotic cysteine protease. This indicates that the 
N-terminus of huntingtin is primarily accessible for proteases and distinct proteolytic 
cleavage products can be formed in vitro and in vivo. In addition, there is strong 
evidence that the mutated huntingtin somehow induces apoptotic cell death in HD, 
but the underlying molecular mechanism is not known (Duyao et al., 1995; Portera- 
Cailliau et al., 1995). Our data suggest that a proteolytic cleavage product of 
huntingtin, which is transported into the nucleus by an unknown mechanism, causes 
selective neuronal cell death by the formation of insoluble amyloid-like fibrils. It is 
possible that the transport to the nucleus is facilitated by a specific nuclear transport 
mechanism which is unique to certain neuronal cells and involves abnormal protein- 
protein interactions related to the elongated polygln. Alternatively, there may be 
specific nuclear proteins in the affected neurons which enhance the huntingtin 
protein aggregation. 

Recently, the formation of neuronal intranuclear inclusions in mice transgenic 
for the SCA1 mutation have been detected, indicating that polygln-containing 
polymers are also formed in spinocerebellar ataxia type 1. However, additional 
experiments will be necessary to confirm these results. 

In an additional preferred embodiment of the composition of the present 
invention, said amyloidogenic (poly)peptide self-assembles subsequent to release 
from said fusion protein. 

As has been pointed out herein before, the self-assembly of said (poly)peptides 
only subsequent to the release from the fusion protein provides the advantage that 
an exact time point of the start of the formation can be set. This has a number of 
advantages. For example, inhibitors of aggregate formation can be tested as regards 
their efficacy as a function of time. In addition, obtainment of data is facilitated in 
view of the fact that for amyloid formation a premix may be set up to which only the 
cleaving agent most be added. 

In a further preferred embodiment of the composition of the invention, said 
amyloidogenic (poly)peptide is the amyloid precursor protein (APP), p-protein, an 
immunoglobulin light chain, serum amyloid A, transthyretin, cystatin C, (32- 
microglobuiin, apolipoprotein A-1, gelsoline, islet amyloid polypeptide (IAPP), 



calcitonin, a prion, atrial natriuretic factor (ANF), lysozyme, insulin, fibrinogen, or a- 
synuclein or a fragment or derivative thereof. 

Deposits of p-amyloid in neuritic plaques and blood vessel walls are the 
principal pathological feature in the brains of patients with Alzheimer's disease. 
These amyloid deposits contain the 39-43 amino acid p-amyloid peptide which is 
derived by proteolytic cleavage from the larger precursor, the amyloid precursor 
protein (APP). There is strong evidence that the formation and aggregation of p- 
amyloids into fibrils is the primary pathogenic event leading to amyloid deposition in 
Alzheimer's disease. 

An additional preferred embodiment relates to a composition wherein said 
(poly)peptide defined in (aa) is glutathione s-transferase (GST), intein, thioredoxin, 
dihydrofolate reductase (DHFR) or chymotrypsin inhibitor 2 (CI2) or a functional 
fragment or derivative thereof. 

All of these (poly)peptides may be advantageously used to enhance, the 
solubility and/or prevent aggregation of the fusion proteins of the invention. 
Particularly preferred is to employ intein in said composition because the protein has 
been modified such that it undergoes a self-cleavage reaction at its N-terminus at 
low temperatures in the presence ot thiols such as DTT. Also comprised by this 
embodiment are functional fragments of any of these above recited proteins. The 
term "functional fragment" as used herein is intended to denote the capability of said 
fragment to confer solubility or prevent aggregation. 

Further preferred is that the nucleic acid contained in the composition of the 
invention is DNA. Particularly preferred is that said DNA is cDNA, synthetic DNA or 
(semi)synthetic DNA. 

Also preferred is that the vector that may be contained in the composition of the 
invention is an expression vector or a gene targeting vector. These vectors may 
advantageously be used for transfecting hosts that may or may not be contained in 
the composition of the invention. Said hosts may then be used for the production of 
the fusion protein comprised in the composition of the invention. 

Preferably, said host is a bacterial, preferably an E.coli, an animal-, preferably a 
mammalian, an insect-, a plant-, a fungal, preferably a yeast- and most preferably a 
Saccharomyces or Aspergillus cell, a Pichia pastoris cell, a transgenic animal or a 
transgenic plant. 



The present invention also relates to a method of producing a fusion protein as 
defined in the diagnostic composition of any of the preceding claims comprising 
culturing or raising the host as defined in claim 11 and isolating said fusion protein. 
Preferably the bacterial host E. coli is used for the expression of the GST-HD fusion 
proteins. The cDNA fragments containing CAG repeats in the normal or pathological 
range may, for example, be cloned into pGEX-5X-1 (Pharmacia) and the resulting 
plasmids expressing fusion proteins with polygln-sequences used for protein 
purification. The resulting proteins may be purified under native conditions by affinity 
chromatography on glutathione agarose (Smith and Johnson, 1988). 

Additionally, the present invention relates in a preferred embodiment to a 
composition wherein said antibody is a monoclonal antibody, polyclonal antibody, 
phage display antibody or a fragment or derivative thereof. 

The above recited fragments or derivatives comprise Fv-, Fab-, F(ab)' 2 - 
fragments, single-chain antibodies or single-chain antibody domains. These 
antibodies may advantageously be used in experiments such as Western blotting 
experiments to determine the presence of the protein on, for example, nitrocellulose 
membrane. 

Alternatively, the antibody, derivative or fragment thereof may be used in 
immunometric assays such as ELISAs or RIAs or may be coupled to columns in 
order to retain the fusion protein from, for example, mammalian sources on said 
column for further detection or purification. 

Alternative uses and advantages of the antibody or fragment or derivative 
contained in the composition of the invention are, on the basis of the teachings of 
this invention, clear to the person skilled in the art. 

The invention further relates to an in vitro method of producing amyloid 
aggregates comprising (a) at least partially cleaving the fusion protein comprised in 
the diagnostic composition of the invention wherein the (poly)peptide that is released 
has the ability to self-assemble into amyloid-like fibrils or protein aggregates or (b) 
inducing self-assembly into amyloid-like fibrils or protein aggregates by heating the 
fusion protein comprised in the composition of any one of claims 1 to 13 or an 
amyloidogenic (poly)peptide that has the ability to self-assemble into amyloid-like 
fibrils or protein aggregates. 
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The method of the invention may advantageously be used to study in more 
detail the process that leads to the formation of amyloid-like fibrils or protein 
aggregates from amyloidogenic (poly)peptides. Using the method of the invention, 
the onset of, for example, HD or AD can be examined in an in vitro situation. It is 
important that the polypeptide that is released by the cleaving agent still retains the 
possibility to form amyloid-like fibrils or protein aggregates. The formation of such 
fibrils or aggregates may be monitored, for example, by electron or light microscopy. 
Varying the cleaving conditions in cases where more than one cleavage site is 
present on the fusion protein may be used to further elaborate the minimal 
requirements for said amyloid-like fibril or protein aggregate formation. 

Preferably, the cleavage is effected chemically or enzymatically, or by the intein 
self-cleavage reaction in the presence of thiols. 

In the following, enzymatic cleavage will also be referred to as proteolytic 
cleavage. The enzymatic cleavage has the advantage that the cleavage reaction can 
be performed under almost physiological conditions and normally only low amounts 
of the protease are necessary for the cleavage reaction. The disadvantage of the 
enzymatic cleavage reaction is that prospective inhibitors might inhibit the protease 
and in turn prevent the formation of protein aggregates. In comparison, this is not the 
case when the cleavage reaction is performed chemically. 

Further, the present invention relates to a method of testing a prospective 
inhibitor of aggregate formation of a fusion protein as defined in the composition 'of 
the invention when enzymatically or chemically cleaved or a non-cleaved fusion 
amyloidogenic (poly)peptide as defined in any one of claims 1 to 13 or an 
amyloidogenic non-fusion (poly)peptide comprising 

(a) incubating in the presence of a prospective inhibitor 

(aa) said fusion protein in the presence or absence of a cleaving agent; or 

(ab) said non-fusion poly(peptide); and 

(b) assessing the formation of amyloid-like fibrils or protein aggregates. 

This method of the present invention provides a particularly strong impact on 
the pharmaceutical research related to amyloid-associated diseases. For the first- 
time, an inhibitor of fibril or aggregate formation can conveniently, directly, easily and 
within a short time be tested in vitro. As has been detailed herein above, aggregate 
formation may be tested on cleavage products, on non-cleaved fusion proteins or on 
the above recited non-fusion proteins which have the capacity to aggregate when the 
temperature is raised, the pH is lowered or the protein is dissolved in urea and the 



urea is slowly diluted out with a solvent. Additionally, the present invention does not 
exclude self-assembly under different conditions. 

It was shown recently that acid-mediated denaturation of, e.g., transthyretin 
yields a conformational intermediate that can self-assemble into amyloid (Lai et al. f 
1996). Booth et al. demonstrated that heat denaturation of human lysozyme variants 
resulted in instability, unfolding, and amyloid fibrillogenesis. 

Preferably, the incubation is effected in the presence of factor Xa, trypsin, 
endoproteinase Arg-C, endoproteinase Lys-C, proteinase K or elastase at a 
temperature of preferably 25 to 37°C for 0,5 to 16 hours and the assessment of the 
formation of fibrils or aggregates in step (b) is preferably effected by a filter assay or 
by a thioflavine T (ThT) fluorescence assay, in which the fluorescence intensity 
reflects the degree of aggregation. 

As regards the filter assay, a more detailed protocol thereof is explained in the 
European patent application entitled "Novel method of detecting amyloid-like fibrils or 
protein aggregates" filed on the same day with the European Patent Office and 
assigned to the same applicant which is explicitly incorporated herein by reference. 

Additionally, the present invention relates to a method for identifying an inhibitor 
of aggregate formation of a fusion protein as defined in the invention prior to or after 
proteolytic or chemical cleavage or of a non-fusion amyloidogenic (poly)peptide as 
described herein above comprising 

(a) loading a surface or gel with said protein or an aggregate thereof; 

(b) incubating said surface or gel with a prospective inhibitor; and 

(c) assessing whether the presence of said prospective inhibitor avoids or reduces 
aggregate formation or further aggregate formation. 

In accordance with this embodiment of the invention, proteolytic or chemical 
cleavage can be advantageously effected either prior or after the loading of the 
surface or gel leaving the investigator additional degrees of freedom in devicing his 
experiments. The' method of the invention is both useful for investigating the onset of 
aggregate or fibril formation or assessing the progression of such a process starting 
from the already existing aggregate or fibril. The latter embodiment is particularly 
useful in investigating treatment conditions for patients that are already affected by 
the disease at an early or medium stage thereof. 
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There is strong evidence that the formation of amyloids is a nucleation- 
dependent polymerization similar to protein crystallization or microtubule assembly. 
However, the deposition of a monomer onto a preexisting amyloid template is 
independent of the nucleation process. Thus, it will be very important to study the 
deposition of monomers onto a defined template under physiological conditions. With 
our in vitro system we should be able to monitor the deposition of radiolabeled 
polygln-containing monomers. 

Preferably, said surface employed in the method of the invention is a 
membrane. Preferably, the membrane should be cellulose acetate and should have 
a low binding capacity for soluble proteins. 

The invention also relates to an inhibitor identifiable or identified by the method 
of the invention. 

The various methods described herein above will give rise to the isolation of a 
number of inhibitors which are also comprised by the present invention. Once such 
an inhibitor is known, it is of course not necessary to identify it again by the method 
of the invention. Rather, said inhibitor can be produced by chemical or recombinant 
means. In the case that the inhibitor is of proteinaceous material, it is preferred to 
resynthesize a compound having the or most of the characteristics of said inhibitor 
by peptidomimetics. 

p re f era b|y, a number of compounds or compound classes are tested for their 
efficacy to inhibit amyloid-like fibril or protein aggregate formation. Said compounds 
comprise an antibody, 4'-lodo-4'-deoxydoxorubicin (IDOX), pyronine Y, guanidine 
hydrochloride, urea, rifampicin and derivatives thereof, myristyltrimethylammonium 
bromide, hydroquinone, p-benzoquinone, 1 ,4-dihydroxynaphthalene, p- 
methoxyphenol, a-tocopherol, ascorbic acid, p-carotene, anthracycline, doxorubicin, 
hexadecyl-N-methylpiperidinium, dodecyltrimethyl-ammonium, N-tetradecyl-N,N- 
dimethyl-3-ammonio-1-propanesulfonate, a (poly)peptide, glutamine or an 
oligoglutamine peptide. 

The present invention also relates to a pharmaceutical composition comprising 
the inhibitor of the invention and a pharmaceutical^ acceptable carrier and/or diluent. 

The pharmaceutical composition of the invention will find wide applicability in 
the medical field. Essentially all diseases associated with protein aggregate 
formation or amyloid-like fibril formation, in particular if they are associated with 



neuronal tissue or cells, may be effectively treated with the pharmaceutical 
composition of the invention. 

Examples of suitable pharmaceutical carriers are well known in the art and 
include phosphate buffered saline solutions, water, emulsions, such as oil/water 
emulsions, various types of wetting agents, sterile solutions etc. Compositions 
comprising such carriers can be formulated by well known conventional methods. 
These pharmaceutical compositions can be administered to the subject at a suitable 
dose. Administration of the suitable compositions may be effected by different ways, 
e.g., by intravenous, intraperitoneal, subcutaneous, intramuscular, topical or 
intradermal administration. The dosage regimen will be determined by the attending 
physician and other clinical factors. As is well known in the medical arts, dosages for 
any one patient depends upon many factors, including the patient's size, body 
surface area, age, the particular compound to be administered, sex, time and route 
of administration, general health, and other drugs being administered concurrently. 
The compositions of the invention may be administered locally or systemically. 
Administration will generally be parenterally, e.g., intravenously; DNA may also be 
administered directly to the target site, e.g., by biolistic delivery to an internal or 
external target site or by catheter to a site in an artery. 

The therapeutically useful compounds identified according to the method of the 
invention may be administered to a patient by any appropriate method for the 
particular compound, e.g., orally, intravenously, parenterally, transdermally, 
transmucosally, or by surgery or implantation (e.g., with the compound being in the 
form of a solid or semi-solid biologically compatible and resorbable matrix) at or near 
the site where the effect of the compound is desired. 

Finally, the present invention relates to a transgenic mammal or plant 
comprising a nucleic acid molecule or vector as described in the invention. The 
transgenic mammal or plant would be advantageously be used for the in vivo testing 
of the efficacy of the inhibitors referred to above. With the pharmaceutical 
composition of the invention the formation of protein aggregates in the brain or other 
tissues of a transgenic animal can be monitored quantitatively. Furthermore, in vivo 
studies relating to the onset or progress of the above recited diseases may be 
carried out. 



The figures show: 



Figure 1 

SDS-PAGE Analysis of Purified GST and GST-HD Fusion Proteins. 

(a) Aliquots (15 ml) of eluates from the glutathione agarose column were 
subjected to 12.5 % SDS-PAGE and analyzed by staining with Coomassie blue R. 
Lanes 1-6 contain GST, GST-HD20, -HD30, -HD83 and -HD122, respectively; lane 
M contains molecular mass standards, (b) Proteins were transferred to nitrocellulose 
and probed with anti-HD1 antibody. Arrows mark the origin of electrophoresis. 

Figure 2 

Structure of GST-HD fusion proteins. 

The amino acid sequence corresponding to exon 1 of huntingtin is boxed. 
Arrows labeled Xa and T indicate cleavage sites for factor Xa and trypsin, 
respectively. 

Figure 3 

Site-Specific Proteolysis of GST-HD Fusion Proteins with Trypsin and Factor 
Xa. 

Tryptic digestions were performed at 37°C for 3 (a) or 16 h (b). Native proteins 
and their cleavage products were subjected to 12.5% SDS-PAGE, blotted onto 
nitrocellulose membranes, and probed with anti-HD1 antibody. Arrows mark the 
origin of electrophoresis, (c) Purified fusion proteins and their factor Xa and trypsin 
cleavage products were analyzed using the filter retardation assay. The proteins 
retained by the cellulose acetate and nitrocellulose membranes were detected by 
incubation with the anti-HD1 antibody. 

Figure 4 

Electron Micrographs of Native GST-HD Fusion Proteins and their Factor Xa 
and Trypsin Cleavage Products. 

Purified GST fusion proteins were protease treated, negatively stained with 
uranyl acetate and viewed by electron microscopy. The undigested GST-HD51 
molecules appear as a homogeneous population of small, round particles (a). 
Removal of the GST-tag with factor Xa results in the formation of amyloid-like fibrils 
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and intermediate structures (b + c). After partial digestion (3 h) of GST-HD51 with 
trypsin, the ribbons are associated with terminal clots (d, arrow), whereas prolonged 
digestion (16 h) produces ribbons without attached clots (e). Removal of the GST-tag 
from GST-HD20 shows no evidence for the formation of defined structures (f). 

Figure 5 

Birefringence of Protein Aggregates Formed by Proteolytic Cleavage of GST- 
HDSL 

The protein aggregates were stained with Congo red. (a) Bright field, 200x; (b) 
Polarized light, 200x; (c) Polarized light, 100x. 

Figure 6 

Polygln-Containing Protein Aggregates are Formed in vivo. 

(a) Western blot analysis, after separation by 10% SDS-PAGE, of the nuclear 
(N) and cytosolic (C) protein fractions prepared from brain and kidney of an R6/2 
hemizygous transgenic mouse and a littermate control. Blots were probed with anti- 
HD1, anti-GAPDH and anti-Fos B antibodies as indicated, (b) Detection of HD exon 
1 protein aggregates formed in vivo using the cellulose acetate filter assay. The 
membrane was immunostained using the anti-HD1 antibody, (c) Ultrastructure of a 
neuronal intranuclear inclusion (Nil). The presence of a Nil in a striatal neuron of a 
17 month old R6/5 homozygous mouse is shown. The Nil is indicated by the large 
arrow and the fibrillar amyloid-like structures within the Nil are indicated by two small 
arrows. The scale bar is 250 nm. 

Figure 7 

Proposed Mechanism for the Formation of Amyloid-like Fibrils by Proteolytic 
Cleavage of GST-HD51 . 

GST-HD51 molecules are represented as follows: zigzag line, elongated 
polygln repeat forming a stable hairpin with p-sheet structure; dotted line, N-terminal 
amino acids in the HD exon 1 protein containing the factor Xa cleavage site (arrow) 
(undefined structure); thin line, C-terminal amino acids in the HD exon 1 protein 
(undefined structure); shaded symbol, the dimeric globule-like form of GST. 

Prior to cleavage, the HD exon 1 protein is tightly bound to the GST-tag 
preventing intermolecular interactions (1). Removal of the GST-tag with factor Xa 
renders the polygln repeat accessible allowing the formation of fibrils as seen by EM 
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(3). During cleavage, intermediate structures form through specific polygln 
interactions before complete release of the GST-tag has occurred (2). These 
intermediates appear as clots under EM and are frequently seen at the terminals of 
growing fibrils. 

The examples illustrate the invention: 
Example 1: 

Purification of GST-HD fusion proteins containing expanded polyglns 

Exon 1 of the HD gene was isolated from genomic phage clones, derived from 
the normal and expanded alleles of an HD patient (Sathasivam et al., 1997), and 
used for the expression of GST-HD fusion proteins in E. coli. DNA fragments 
containing CAG repeats in the normal (CAG)20-33 and expanded (CAG)37_-|3o 
range were cloned into pGEX-5X-1 (Pharmacia), and the resulting plasmids 
expressing fusion proteins with 20 (GST-HD20), 30 (-HD30), 51 (-HD51), 83 (-HD83) 
and 122 (-HD122) glutamines, respectively, were used for protein purification. .For 
plasmid construction lambda phage from stock 91974 (Sathasivam et al., 1997) were 
plated to give single plaques which were innoculated into 400 ml cultures of E. coli 
XL1-Blue MRF' (Stratagene) for DNA preparation. The DNA sequence encoding the 
N-terminal portion of huntingtin (exon 1), including the CAG repeats, was amplified 
by PCR using the following pair of primers: ES 25 
(TG GGATCC GCATGGCGACCCTGGAAAAGCTGATGAAGG) corresponding to 
nt31 5-343 of the HD gene (HDCRG, 1993) and containing a BamHI site (underlined) 
and ES 26 (GGA GTCGAC TCACGGTCGGTGCAGCGGCTCCTCAGC) 
corresponding to nt51 6-588 and containing a Sail site (underlined). Conditions for 
PCR were as described (Mangiarini et al. 1996). Due to instability of the CAG repeat 
during propagation in £. coli , DNA preparations from individual plaques yielded 
different sized PCR products. Fragments of ~ 320, 360, 480, and 590 bp were gel- 
purified, digested with BamHI and Sail and inserted into the BamHI-Sall site of the 
expression vector pGEX-5X-1 (Pharmacia), yielding pCAG30, pCAG51, pCAG83 
and pCAG122, respectively. pCAG20, containing 20 repeats of CAG within the 
cloned HD exon 1 sequence, was similarly constructed from a phage genomic clone 
derived from a normal allele. All constructs were- verified by sequencing. After 
induction with IPTG, the resulting proteins were purified under native conditions by 
affinity chromatography on glutathione agarose. Thus, E. coli SCSI (Stratagene) 
carrying the pGEX expression plasmid of interest was grown to an OD600nm of °- 6 
and induced with IPTG (1 mM) for 3.5 h as described in the manufacturer's protocol 
(Pharmacia). Cultures (200 ml) of induced bacteria were centrifuged at 4000 g for 20 
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min, and the resulting pellets were stored at -80°C. Cells were thawed on ice and 
resuspended in 5 ml of lysis buffer (50 mM sodium phosphate, 150 mM NaCI, 1 mM 
EDTA, pH 7.4) containing 0.5 mg/ml lysozyme. After 45 min at 0°C, cells were 
sonicated with two 30 sec-bursts. Octyl-p-D-glucopyranoside was then added to a 
final concentration of 0.1% and the resulting lysate was clarified by centrifugation at 
30,000 g for 30 min at 4°C. Cleared lysates were incubated for 1 h at 4°C with 500 pi 
of a 1:1 slurry of glutathione-agarose beads (Sigma) that had been washed times 
and resuspended in lysis buffer. The beads were poured into a small column and 
washed extensively with lysis buffer containing 0.1% octyl-p-D-glucopyranoside. The 
bound fusion protein was eluted with 2 ml of 15 mM glutathione (reduced) in lysis 
buffer. Typical yields were 0.5-1 mg of purified GST-HD20, -HD30 and -HD51 
proteins per 200 ml of bacterial culture; yields of GST-HD83 and -HD122 were much 
lower, less than 10% of that obtained with .the shorter fusion proteins. Protein was 
determined by the Bio-Rad dye binding assay using bovine serum albumin as 
standard. SDS-PAGE of the purified GST-HD20, -HD30, -HD51, -HD83 and -HD122 
proteins revealed major bands of 42, 45, 50, 65 and 75 kDa, respectively (Fig. 1a). 
These bands were also detected when the various protein fractions were subjected 
to immunoblot analysis using the affinity purified anti-huntingtin antibody HD1 (Fig. 
1b, lanes 2-6). HD1 specifically detects the GST-HD fusion proteins on immunoblots, 
whereas the GST-tag alone is not recognized (Fig. 1b f lane 1). For immunoblotting a 
bacterial plasmid encoding HD1-His, a Hise-tagged fusion protein containing 
residues 1-222 of huntingtin, was generated by inserting a PCR-ampIified IT-15 
cDNA fragment into the pQE-32 vector (Qiagen). The fusion protein was expressed 
in E. co//, affinity-purified under denaturating conditions on Ni-NTA agarose, and 
injected into rabbits. The resulting immune serum was then affinity-purified against 
the antigen that had been immobilized on Ni-NTA agarose. The GAPDH- and Fos B- 
specific antisera have been described (Wanker et al., 1997; Davies et al., 1997). 

Western blotting was performed as detailed (Towbin et al., 1979). The blots 
were incubated with 1:1000 dilutions of the indicated primary antibody, followed by 
an alkaline-phosphatase-conjugated secondary antibody. Color development was 
carried out with 5-bromo-4-chioro-3-indolyl phosphate and nitroblue tetrazolium as 
substrates (Promega). 

All recombinant proteins migrated at a size corresponding nearly to that 
predicted from their amino acid sequence. Interestingly, an additional high molecular 
weight band which remains at the top of the gel, was consistently detected in the 
protein fractions with the longest polyglns (83 and 122 residues; Fig. 1a and b, lane 
5 and 6). This band was most prominent on the immunoblots but was also clearly 



detectable in the Commassie stained gel. This immunoreactive material was often 
still present at the bottom of the loading slots, even after the samples had been 
boiled for 5 min in the presence of 2% SDS and 6 M urea prior to loading. 

Example 2: 

Proteolytic cleavage of GST-HD fusion proteins containing expanded polyglns 

It has been shown previously that the solubility of certain proteins can be 
enhanced by the addition of the GST-tag (Smith and Johnson, 1988) and it was 
therefore of interest to determine whether the removal of the GST-tag by proteolytic 
cleavage would have an effect on the solubility of the polygln-containing fusion 
proteins. Potential factor Xa and trypsin cleavage sites within the GST-HD fusion 
proteins are shown in Fig. 2. Factor Xa cleaves between the GST-tag and the HD 
exon 1 protein whereas trypsin removes an additional 15 amino acids from the N- 
terminus and a single proline from the C-terminus, both proteases leaving the polygln 
repeat intact. The GST-HD20, -HD30 and -HD51 proteins were digested with trypsin 
under conditions designed to remove the GST-tag from the fusion protein without it 
being totally degraded. After cleavage, proteins were denatured by boiling in the 
presence of 2% SDS and analyzed by SDS-PAGE and immunoblotting using the 
anti-HD1 antibody. GST-HD20 and -HD30 cleavage yielded products migrating in a 
12.5 % gel at approximately 30 and 33 kDa, respectively. In contrast, cleavage of 
GST-HD51 resulted in the formation of two protein products migrating, at 
approximately 37 and 60 kDa, and an additional weak immunoreactive band on the 
bottom of the loading slots was also detected (Fig. 3a). This high molecular weight 
band was more pronounced when GST-HD51 was digested with trypsin under 
conditions in which the GST-tag was totally degraded (Fig. 3b). However, with 
proteins GST-HD20 and -HD30 this longer exposure to trypsin produced the same 
cleavage products as the ones seen in Fig. 3a and the high molecular weight 
products were not observed. Similar results were obtained with factor Xa protease 
and endoproteinases Arg-C and Lys-C. As regards the proteolytic cleavages, the 
following protocols were carried out: The GST-HD fusion proteins purified as 
described above were dialysed against 40 mM Tris-HCI (pH 8.0), 150 mM NaCI, 0.1 
mM EDTA and 5% (v/v) glycerol to raise the pH prior to proteolytic cleavage. The 
proteins were then combined with bovine factor Xa (New England Biolabs) or 
modified trypsin (Boehringer Mannheim, sequencing grade) in dialysis buffer 
containing 2 mM CaCl2 at an enzymersubstrate ratio of 1:10 (w/w) or 1:40 (w/w), 
respectively. Incubations with factor Xa were at 25°C for 16 h. Tryptic digestions 
were performed at 37°C for 3 or 16 h as indicated. Digestions were terminated by the 
addition of PMSF to 1 mM . The degree of proteolysis was determined by SDS- 



PAGE followed by staining with Coomassie blue or immunoblottting using anti-HD1 
antibody. 

We have developed a simple and sensitive filter assay to detect the formation 
of high molecular weight insoluble protein aggregates. This assay is based on the 
finding that the SDS-insoluble protein aggregates obtained by proteolytic cleavage of 
GST-HD51 are retained on a cellulose acetate filter, whereas the soluble cleavage 
products of GST-HD20 and GST-HD30 are not. Factor Xa or trypsin digestions of 
purified GST-HD fusion proteins (10 pg) were performed in a 20 pi reaction mixture 
as described above. Reactions were terminated by adjusting the mixture to 2% SDS 
and 50 mM DTT. After heating at 100°C for 5 min, aliqouts (0.5 pi) were diluted into 
200 pi of 0.1% SDS and filtered through a cellulose acetate membrane (Schleicher & 
Schuell, 0.2 pm pore size) using a BRL dot blot filtration unit. Filters were washed 
with water, and the SDS-insoluble aggregates retained on the filter detected by 
incubation with the anti-HD1 antibody, followed by an anti-rabbit secondary antibody 
conjugated to alkaline phosphatase (Boehringer Mannheim). Fig. 3c shows 
immunoblots of cellulose acetate and nitrocellulose membranes to which the native 
GST-HD20, -HD30 and -HD51 proteins and their factor Xa and trypsin cleavage 
products have been applied. On the cellulose acetate filter, only the cleavage 
products of GST-HD51 were detected by the anti-HD1 antibody, indicating the 
formation of insoluble high molecular weight protein aggregates. In contrast, all the 
uncleaved GST-HD fusion proteins and their digestion products were detected on the 
nitrocellulose control filter. This assay was also used to detect huntingtin aggregates 
present in a nuclear fraction from the brain of an R6/2 hemizygous mouse and 
littermate control (see preparation of nuclei below). 

Example 3: 

Huntingtin proteins containing expanded polyglns in the pathological range 
aggregate to amyloid-like birefringent fibrils 

Electron microscopy of negatively stained GST-HD51 fractions showed 
oligomeric particles with diameters of 6 to 7 nm (Fig. 4a); no higher ordered 
aggregates were observed. For electron microscopic observation, the native or 
protease-digested GST-HD fusion proteins were adjusted to a final concentration of 
50 pg/ml in 40 mM Tris-HCI (pH 8.0), 150 mM NaCI, 0.1 mM EDTA and 5% glycerol. 
Samples were negatively stained with 1% uranyl acetate and viewed in a Philips 
CM100 EM. In contrast, protein fractions obtained by proteolytic cleavage of GST- 
HD51 showed numerous clusters of high molecular weight fibrils and ribbon-like 
structures (Fig. 4b, c, d and e), reminiscent of purified amyloids (Prusiner et al., 



1983). The fibrils obtained after digestion with factor Xa showed a diameter of 10-12 
nm and their length varied from 100 nm up to several micrometers (Fig. 4b and c). In 
the trypsin-treated samples ribbon-like structures formed by lateral aggregation of 
fibrils with a diameter of 7.7 nm were observed (Fig. 4d and e). After treatment with 
factor Xa or limited digestion with trypsin, clots of small particles were frequently 
detected on one or both ends of the fibrils (Fig. 4b, c and d). These clots of varying 
sizes and shapes were not seen when GST-HD51 was digested with trypsin under 
conditions in which the GST-tag is totally degraded (Fig. 4e), indicating that they 
contain GST. In strong contrast to GST-HD51, the GST-HD20 and -HD30 proteins 
did not show any tendency to form ordered high molecular weight structures, either 
with or without protease treatment (Fig. 4f). 

The insoluble protein aggregates formed by proteolytic cleavage of GST-HD51 
were isolated by centrifugation and stained with Congo red (Caputo et al., 1992) and 
examined under a light microscope. For light microscopy, peptide aggregates formed 
by trypsin digestion of purified GST-HD fusion proteins (50 pg in 100 pi of digestion 
buffer) were collected by centrifugation at 30,000 g for 1 h and resuspended in 10 ; pi 
of water. Samples were mixed with 0.1 volume of a 2% (w/v) aqueous Congo Red 
(Sigma) solution, placed on aminoalkylsilane-coated glass slides, and allowed to dry 
overnight under a coverslip. After removing the coverslip, excess Congo Red was 
removed by washing with 90% ethanol. Evaluation of the Congo Red staining by 
polarization microscopy was performed using a Zeiss Axiolab Pol microscope 
equipped with strain-free lenses and optimally aligned cross-polarizers. After 
staining, the protein aggregates on the glass slides were red, indicating that they had 
bound the dye (Fig. 5a), and when examined under polarized light a green color and 
birefringence were detected (Fig. 5b and c). These staining characteristics were 
similar to those observed for prions (Prusiner et al., 1983) and amyloids (Caputo et 
al., 1992). 

Example 4: 

Huntingtin proteins containing expanded polyglns form amyloid-like protein 
aggregates in vivo 

To determine whether the amyloid-like protein aggregates formed by proteolytic 
cleavage of GST-HD51 in vitro are also present in vivo, nuclear protein fractions of 
brain and kidney were prepared from mice transgenic for the HD mutation (line 
R6/2) and littermate controls (Davies et al., 1997; Mangiarini et al., 1996). Nuclei 
from the brain or kidney of an R6/2 hemizygous mouse with a repeat expansion of 
(CAG)i43 (Mangiarini et al., 1996) at ten weeks of age and littermate control were 
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prepared as follows. Whole brain samples (80 mg) in 400 ml of 0.25 M sucrose in 
buffer A (50 mM triethanolamine [pH 7.5], 25 mM KCI, 5 mM MgCl2, 0.5 mM DTT, 
0.5 mM PMSF) were homogenized using 15 strokes of a tight-fitting glass 
homogenizes The homogenate was adjusted to a final concentration of 5 mM DTT, 
and centrifuged at 800 g for 15 min. The supernatant was recentrifuged at 100,000 g 
for 1 h, and the supernatant from this centrifugation was taken as the cytosolic 
fraction (fraction C). The loose pellet from the first centrifugation was homogenized, 
diluted to 1.2 ml with 0.25 M sucrose/buffer A, and mixed with two volumes of 2.3 M 
sucrose/buffer A. The mixture was then layered on top of 0.6 ml 2.3 M 
sucrose/bufferA in a SW60 tube and centrifuged at 124,000 g for 1 h. The pellet was 
harvested with a spatula, resuspended in 200 pi of 0.25 M sucrose/buffer A and 
again centrifuged at 800 g for 15 min. The entire procedure was carried out at 4 °C. 
The pelleted nuclei were resuspended to a density of - 1 x 10 7 nuclei/ml in 0.25 
sucrose/buffer A (fraction N) and stored at -80 °C. Nuclei from mouse kidney were 
prepared in the same way. The protein extracts were analyzed by SDS-PAGE and 
Western blotting using the anti-HD1 antibody (Fig. 6a). Strikingly, this antibody 
detected a prominent high molecular weight band in the nuclear fraction (N) prepared 
from R6/2 transgenic brain, very similar to the high molecular weight band obtained 
by proteolytic cleavage of GST-HD51 (Fig. 3b). No such immunoreactive band was 
detected in the nuclear fraction of brain from the littermate control and it was also 
absent from the corresponding cytoplasmic fractions (C). A small amount of high 
molecular weight material was also detected in the nuclear fraction prepared from 
R6/2 transgenic kidney, but was again absent from the cytoplasmic fraction. The 
purity of the nuclear and cytoplasmic fractions was confirmed by Western blot 
analysis using the anti-Fos B and anti-GAPDH antibodies. Anti-Fos B detected the 
transcription factor mainly in the nuclear fraction, and the enzyme GAPDH was only 
seen in the cytoplasmic fraction, as expected. The Western blot results were 
reproduced using the cellulose acetate filter assay (Fig. 6b). Using this assay, a 10- 
20 fold higher amount of transgene protein was detected in the nuclear fraction 
isolated from brain material, compared to that prepared from kidney. 

The formation of Nils has been shown to preceed the neuronal dysfunction that 
forms the basis 'of the progressive neurological phenotype observed in the R6 
transgenic lines (Davies et al., 1997). These Nils are immunoreactive for both 
huntingtin and ubiquitin antibodies and contain the transgene but not the 
endogenous huntingtin protein. Therefore, Western blot analysis using an anti- 
ubiquitin antibody was also performed showing the same pattern of immunoreactivity 
as had been observed with the anti-HD1 antibody (Fig. 6a), and indicating that the 



high molecular weight transgene protein present in the nuclear fraction is 
ubiquitinated (data not shown). 

To examine whether the Nils containing the proteins huntingtin and ubiquitin 
(Davies et al. t 1997) have a fibrous composition, an ultrastructural analysis was 
performed. Experimentally, a 17 month old R6/5 homozygous mouse ((CAG)i28- 
155) (Mangiarini et al., 1996) was deeply anaesthetised with sodium pentobarbitone 
and then perfused through the left cardiac ventricle with 35-50 ml of 4 % 
paraformaldehyde and either 0.5 % glutaraldehyde in 0.1 M Millonig's phosphate 
buffer (pH 7.4). The brain was removed from the skull and placed in fresh fixative 
overnight at 4 °C. Coronal sections (50 - 200 pm) were cut on an Oxford Vibratome 
(Lancer) and collected in serial order in 0.1 M phosphate buffer. After being 
osmicated (30 min in 1% OSO4 in 0.1 M phosphate buffer) the sections were stained 
for 15 min in 0.1 % uranyl acetate in sodium acetate buffer at 4 °C, dehydrated in 
ethanols, cleared in propylene oxide and embedded in Araldite between two sheets 
of Melanex (ICI). Semi thin (1 pm) sections" were cut with glass knives and stained 
with toluidine blue adjacent to thin sections cut with a diamond knife on a Reichert 
Ultracut ultramicrotome. The sections were collected on mesh grids coated with a 
thin formvarfilm, counterstained with lead citrate and viewed in a Jeol 1010 electron 
microscope. An electron micrograph of a Nil from a 17 month old R6/5 homozygous 
mouse is shown in Fig. 6c. This Nil (large arrow) contains high molecular weight 
fibrous structures which were clearly differentiated from the surrounding chromatin. 
The filaments were randomly oriented, 5-10 nm in diameter and often measured, up 
to 250 nm in length (sma'll arrows). These structures differ from those previously 
reported in the Nils seen in hemizygous R6/2 mice which were far more granular in 
composition, with individual filamentous structures being more difficult to distinguish 
(Davies et al., 1997). R6/2 mice exhibit an earlier age of onset with a more rapid 
progression of the phenotype and do not survive beyond 13 weeks (Mangiarini et al., 
1996). It is possible that the filamentous structures do not have time to form in the 
R6/2 mice. 
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CLAIMS 



1 . A composition comprising 

(a) a nucleic acid molecule encoding a fusion protein comprising 

(aa) a (poly)peptide that enhances solubility and/or prevents aggregation of 
said fusion protein; and 

(ab) an amyloidogenic (poly)peptide that has the ability to self-assemble into 
amyloid-like fibrils or protein aggregates; 

(b) a vector containing the nucleic acid molecule of (a); 

(c) a host transformed with the vector of (b); 

(d) a fusion protein encoded by the nucleic acid of (a) or a functional 
derivative thereof; and/or 

(e) an antibody specific for the fusion protein of (d). 

2. The composition of claim 1 wherein the amyloidogenic (poly)peptide comprises 
a polyglutamine expansion. 

3. The composition of claim 2 wherein said polyglutamine expansion comprises at 
least 35 glutamines. 

4. The composition of claim 3 wherein said polyglutamine expansion comprises at 
least 51 glutamines. * 

5. The composition of any one of claims 2 to 4 wherein said (poly)peptide defined 
in (ab) is huntingtin, androgen receptor, atropin, TATA binding protein, or 
ataxin-1 ,-2,-3 or -6 or a fragment or derivative thereof. 

6. The composition of any one of claims 1 to 5 wherein said amyloidogenic 
(poly)peptide self-assembles subsequent to release from said fusion protein. 

7. The composition of claim 1 wherein said amyloidogenic (poly)peptide is the 
amyloid precursor protein (APP), p-protein, an immunoglobulin light chain, 
serum amyloid A, transthyretin, cystatin C, p2-microglobulin, apolipoprotein A-1, 
gelsoline, islet amyloid polypeptide (IAPP), calcitonin, a prion, atrial natriuretic 
factor (ANF), lysozyme, insulin; fibrinogen, or a-synuclein or a fragment or 
derivative thereof. 



8. The composition of any one of claims 1 to 7 wherein said (poly)peptide defined 
in (aa) is glutathione S-transferase (GST), intein, thioredoxin, dihydrofolate 
reductase (DHFR) or chymotrypsin inhibitor 2 (CI2) or a functional fragment or 
derivative thereof. 

9. The composition of any one of claims 1 to 8 wherein said nucleic acid is DNA. 

10. The composition of any one of claim 1 to 9 wherein said vector is an expression 
vector or a gene targeting vector. 

11. The composition of any one of claims 1 to 10 wherein said host is a bacterial, 
preferably an E.coli, an animal-, preferably a mammalian, an insect-, a plant-, a 
fungal, preferably a yeast- and most preferably a Saccharomyces or Aspergillus 
cell, a Pichia pastoris cell, a transgenic animal or a transgenic plant. 

12. A method of producing a fusion protein as defined in the composition of any of 
the preceding claims comprising culturing or raising the host as defined in claim 
1 1 and isolating said fusion protein. 

13. The composition of any one of claims 1 to 12 wherein said antibody is a 
monoclonal antibody, polyclonal antibody, phage display antibody or a fragment 
or derivative thereof. 

14. An in vitro method of producing amyloid aggregates comprising 

(a) at least partially cleaving the fusion protein comprised in the composition 
of any one of claims 1 to 13 wherein the (poly)peptide that is released has 
the ability to self-assemble into amyloid-like fibrils or protein aggregates; 
or 

(b) inducing self-assembly into amyloid-like fibrils or protein aggregates by 
heating the fusion protein* comprised in the composition of any one of 
claims 1 to 13 or an amyloidogenic (poly)peptide that has the ability to 
self-assemble into amyloid-like fibrils or protein aggregates. 

15. The method of claim 14 wherein said cleavage is effected chemically or 
enzymatically, or by the intein self-cleavage reaction in the presence of thiols. 

16. A method of testing a prospective inhibitor of aggregate formation of a fusion 
protein as defined in the composition of any one of claims 1 to 13 when 
enzymatically or chemically cleaved or a non-cleaved fusion amyloidogenic 



(poly)peptide as defined in any one of claims 1 to 13 or an amyloidogenic non- 
fusion (poly)peptide comprising 

(a) incubating in the presence of a prospective inhibitor 

(aa) said fusion protein in the presence or absence of a cleaving agent; or 

(ab) said non-fusion poly(peptide); and 

(b) assessing the formation of amyloid-like fibrils or protein aggregates. 

17. The method of claim 16 wherein incubation is effected in the presence of factor 
Xa, trypsin, endoproteinase Arg-C, endoproteinase Lys-C, proteinase K or 
elastase at a temperature of preferably 25 to 37°C for 0,5 to 16 hours and the 
assessment of the formation of fibrils or aggregates in step (b) is preferably 
effected by a filter assay or by a thioflavine T (ThT) fluorescence assay, in 
which the fluorescence intensity reflects the degree of aggregation. 

18. A method for identifying an inhibitor of aggregate formation of a fusion protein 
as defined in any one of claims 2 to 6 prior to or after proteolytic or chemical 
cleavage or of a non-fusion amyloidogenic (poly)peptide that has the ability, to 
self-assemble into amyloid-like fibrils or protein aggregates comprising 

(a) loading a surface or gel with said protein or an aggregate thereof; 

(b) incubating said surface or gel with a prospective inhibitor; and 

(c) assessing whether the presence of said prospective inhibitor avoids or 
reduces aggregate formation or further aggregate formation. 

19. The method of claim 18 wherein said surface is a membrane. 

20. An inhibitor identifiable or identified by the method of any one of claims 16 to 
19 or a derivative thereof. 

21. The inhibitor of claim 20 which is an antibody, 4'-lodo-4'-deoxydoxorubicin 
(IDOX), pyronine Y, guanidine hydrochloride, urea, rifampicin or a derivative 
thereof, myristyltrimethylammonium bromide, hydroquinone, p-benzoquinone, 
1 ,4-dihydroxynaphthalene, p-methoxyphenol, a-tocopherol ( ascorbic acid, p- 
carotene, * anthracycline, doxorubicin, hexadecyl-N-methylpiperidinium, 
dodecyltrimethyl-ammonium, N-tetradecyl-N f N-dimethyl-3-ammonio-1- 
propanesulfonate, a (poly)peptide, glutamine or an oligoglutamine peptide. 

22. A pharmaceutical composition comprising the inhibitor of claim 20 or 21 and a 
pharmaceutical^ acceptable carrier and/or diluent. 



A transgenic mammal or plant comprising a nucleic acid molecule or vector as 
described in the composition of claim 3 or 4. 
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ABSTRACT 



The present invention relates to novel compositions useful for elucidating the onset 
I plres of diseases of preferably neuronal origin associated with the forma on of 
IvtoWke fibrils or protein aggregates. Further, the present invent™ relates to 
memods or monitoring said formation as we,, as ,o methods for identify^ nnh,b..ors 
tfsaid formation. Additionally, the invention relates to inhibitors MM by the 
method of the invention as we,, as to pharmaceutical compositions compns.ng sa,d 
inhibitors. 



